This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 



BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 



BLACK BORDERS 

TEXT CUT OFF AT TOP, BOTTOM OR SIDES 
FADED TEXT 
ILLEGIBLE TEXT 
SKEWED/SLANTED IMAGES 
COLORED PHOTOS 

BLACK OR VERY BLACK AND WHITE DARK PHOTOS 
GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 

As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



L 



B24 



3 



Europaisches Patents mt 
European Patent Office 
Office europeen des brevets 



(Jj) Publication number: 



O 256 843 

A1 



EUROPEAN PATENT APPLICATION 



© Application number: B7307114.6 
© Date of filing: 11.08.87 



© IntCI. 3 : C 12 N 15/00 

C 12 P 21/02 



® Priority: 11.08.86 US 895154 
18.11.86 US 932037 



© Date of publication of application: 
24.02.88 Bulletin 88/8 

© Designated Contracting States: 

AT BE CH DE ES FR GB GR IT U LU NL SE 



©Applicant: CETUS CORPORATION 
1400 Fifty-Third Street 
Emeryville California 94608 (US) 

© Inventor: Devlin, James Joseph 
522 Silverado Drive 
Lafayette California 94549(USJ 

© Inventor: Devlin, Patricia Ego n 
522 Silverado Drive 
Lafayette California 94649(US) 

© Inventor: Kawasaki, Ernest Seigo 
260 San Mateo Street 
Richmond California 94804(US) 

© Inventor: Warren, Mary Kim 
486 Joaquin Avenue 
San Leandro California 94677(US) 

Representative: Biztey, Richard Edward et al, 
BOULT, WADE & TENNANT27 Fumh/al Street 
London EC4A 1PQ(GB) 



00 

to 
in 

CM 



a. 

UJ 



Expression of g-csf and muteins thereof and their use. 

® Human recombinant G-CSF (hr-G-CSF), an mRNA frac- 
tion encoding hr-G-CSF, a DNA sequence encoding hr-G-CSF, 
modified DNA sequences encoding hr-G-CSF, that are ex- 
pressed at high levels in recombinant hosts are disclosed. 
Modifications to the 5' DNA sequence encoding mature mG- 
CSF ere also disclosed. The modifications to the third posi- 
tion nucleotide of at least one in the 5' region codon of the 
DNA sequence of G-CSF result in enhanced expression of 
mG-CSF protein in recombinant host cells, without changing 
the amino acid sequence of the protein. Other modifications 
yield muteins of G-CSF in which the NH 2 -terminal amino acid 
is not methionine when produced from a recombinant host 
cell. Expression vectors carrying the altered mG-CSF DNA 
sequence are disclosed and claimed. 
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EXPRESSION OF G-CSF AND MUTEINS THEREOF AND THEIR USE 

The Invention concerns the field of recombinant DMA. In 
particular, the invention concerns the isolation of nucleic add 
sequences encoding granulocyte colony-stimulating factor (G-CSF) and 
5 the expression of G-CSF in various host cells. The invention concerns 
modified DNA sequences encoding G-CSF and muteins thereof that are 
efficiently expressed under the control of promoters operable in E. 
coll, expression vectors utilizing the modified DNA sequences for the 
efficient expression of G-CSF in E. coli and Gram-negative hosts known 
10 to exchange genetic material with E. coli, host organisms including 
microorganisms transformed with the expression vectors and G-CSF 
produced thereby. 

Interest in the growth and differentiation of granulocytes 
and macrophages from bone marrow progenitors has prompted the study of 

15 the colony-stimulating factors {CSFs) that regulate these processes . 
The subject has been recently reviewed by D. Metcalf in Blood , 67:257 
(1986). One of these factors, termed granulocyte-CSF (G-CSF) 
stimulates the in vitro growth of predominantly granulocyte colonies 
from bone marrow stem cells. The availability of cell lines that 

20 produce CSFs has been instrumental in the characterization of G-CSF; 
however, proper interpretation of this work requires the precise 
identification of the CSF released by these cells. One such cell 
line, the MIA PaCa-2, derived from a human pancreatic carcinoma, 
reportedly produces both the macrophage-spectfic CSF-1 and a CSF that 

25 can induce the formation of primarily granulocyte colonies in seven 
day bone marrow cell cultures. Wu, M. C, J, Clin. Invest., 67^:1588 
(1981). A CSF-1 cDNA clone has been recently isolated from this cell 
line, Kawasaki et al.» Science, 230:291 (1985). They also showed, by 
translation of sucrose gradient fractionated mRNA In Xenopus laevis 

30 oocytes, that a transcript that sedlmented slower than the 18s marker 
encoded a CSF that Induced the proliferation of murine bone marrow 
cells; however, the CSF-1 clone hybridized most strongly to mRNA 1n 
the fractions that sedlmented faster than the 18s marker. 
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Human G-CSF causes the differentiation of granulocytes and 
monocytes from bone marrow progenitor cells. It also Induces some 
leukemia cells to differentiate Into more mature cell types. Thus, 
human G-CSF may be useful 1n regeneration of Important bipod cell 
populations 1n Immunologically compromised patients and patients with 
anemic disorders or Infections. G-CSF may also be useful 1n 
inhibiting leukemia by causing leukemia cells to differentiate to a 
non-dividing state, thereby checking the uncontrolled proliferation of 
these cell types. Myeloid leukemia cells are known to have receptors 
for G-CSF, for example. Science, 323 :7 (1986). 

Souza et al . have isolated, cloned, and expressed a gene 
encoding G-CSF, which has no significant homology to GM-CSF, in an 
unidentified strain of E. coli using a modified temperature sensitive 
runaway plasmld with a x phage PL promoter and synthetic DNA fragment 
containing an initiation codon, followed by the sequence encoding the 
mature form of unaltered G-CSF. No data on expression levels are, 
however, disclosed. Souza et al., Science . 232:61-65 (1986). 

In addition, a colony stimulating factor having the same N- 
terminal 20 amino acids as that of Souza et al . was isolated from a 
cultured human oral carcinoma as disclosed in European Patent 
Publication 0169,566 to Nagata et al . This clone designated CHU-2, 
has an additional sequence encoding Val-Ser-Glu just prior to the 
cy$35 residue of the protein encoded by the DNA, sequence cloned from 
MIA PaCa-2, as described herein below. No information is given in 
either of the above-mentioned disclosures as to the levels of 
expression in recombinant hosts, particulaly E. coli . Nagata et al . 
subsequently reported a cDNA sequence that lacks the sequence encoding 
the above-mentioned additional tri-peptide sequence. Nagata et al., 
EMBO Journal 5:3 , 575-581 (1986). 

Reportedly, both of these recombinantly produced G-CSFs have 
NH 2 terminal methionines as a result of expression in the recombinant 
bacterial host E. coll . Such Nonterminal methionines are generally 
known to be immunogenic when the protein is administered parenterally 
to a mammalian host. 
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A number of factors are Involved 1n obtaining maximal 
expression of a protein 1n a recombinant host. Among these factors 
are optimization of the physiological conditions for growth of the 
host cell - by changes to the culture media in which the host grows. 
5 Also important is the selection and use of strong promoter and 
ribosome binding sites (RBS) that are operable 1n the expression 
vector to be used. The elimination of attenuator regions if they 
. occur in the region of regulated promoters such as the tryptophan 
operon is also desirable. 

10 A more subtle factor in maximizing protein expression in a 

recombinant host relates to codon utilization. In brief, codon 
utilization may be described as the ability of a particular 
recombinant host organism to translate some codons encoding a 
particular amino add more quickly and accurately than other different 

15 codons encoding the same amino acid. Thus, although the trinucleotide 
code specifying a particular amino acid is degenerate, I.e., more than 
one trinucleotide sequence can encode a particular amino acid, all 
codons encoding a particular amino acid are not utilized by the host 
with equal efficiency. Enhanced expression may be obtained by 

20 changing the nucleotide sequence of a gene in a recombinant host to 
the host preferred codons without changing the amino acid sequence of 
the protein encoded thereby. Additionally, changes in the amino acid 
sequence of the NH 2 terminus may effect processing of the NH 2 -term1nal 
methionine mentioned above. 

25 Another subtlety in the enhancement of gene expression in an 

expression vector in a recombinant host relates to secondary structure 
formation in the mRNA sequence to be expressed. It has been shown, 
for example, that certain "stem and loop" secondary structures placed 
in proximity to the 3' terminus of a gene, lead to increased stability 

30 of the mRNA transcript and Increased expression of the protein encoded 
thereby, Wong et al., Proc. Wat. Acad. Scl. (USA), 83:10, 32-33 
(ig86). However, it has been suggested that formation of stable 
secondary structures at the 5' end of an mRNA transcript may block 
efficient translation thereof, leading to reduced yield of the protein 

35 encoded by the gene. The Influence of mRNA secondary structure on 



0256843 

4 

translation has been discussed 1n Iserentant A F1 ers. Gene , 9:1-12 
(1980) and Buell et al., Nucleic Add Research. 13: 1923-1938 (1985). 
DeLamarter, J. F., et al., EMBO Journal , 4,10 ;2575-2581 (1985) have 
reported that the yield of GM-CSF has been Increased 1n E. coll by 
means of altering third position deoxyrlbocytoslne to 
deoxyrlboadenlnes 1n the first ten of the 5 1 codons of the DNA 
encoding mature GM-CSF, and attribute the Increase 1n protein 
synthesis to the altered 5' end of the mRNA transcript* 

It would be desirable to obtain large amounts of G-CSF for 
development of therapies utilizing the granulocyte stimulating and 
granulocyte-macrophage stimulating characteristics of this colony 
stimulating factor. Furthermore, it would be desirable to obtain 
forms of G-CSF that lack Nonterminal methionine when expressed 1n the 
recombinant host, and particularly 1n E. coli. 

The present invention makes available G-CSF in large 
quantities by producing 1t at high levels in a number of hosts. 

The invention discloses that a previously Isolated mRNA 
fraction from MIA PaCa-2 cells having high bone marrow proliferative 
activity encodes G-CSF. The invention encompasses the G-CSF mRNA. In 
addition, the Invention encompasses a cDNA sequence made from this 
mRNA fraction that encodes the G-CSF gene. 

The Invention also includes DNA sequences encoding mature G- 
CSF; these DNA sequences have been altered to facilitate expression of 
G-CSF in recombinant hosts. In addition, DNA sequence encoding mature 
G^-CSF or a mutein thereof lacking NH 2 -terminal methionine when 
expressed in a recombinant host, are disclosed. 

At present, investigators have encountered great difficulty 
in obtaining significant expression of mature G-CSF in E. coll hosts 
under P L promoter gene N-RBS control or tryptophan (Trp) promoter 
control when the unaltered nucleotide sequence of the mature G-CSF Is 
used. The inventors have determined that, when under ? L gene N-RBS 
control in E^ coli, steady state levels of mRNA encoding G-CSF are 
detectable by Northern analysis as described further herein. Under 
the control of the Trp promoter, no steady state G-CSF mRNA is found 
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1n E. coll by the same method. Surprisingly, in light of the work of 
Delemarter, et al . which suggests that alterations of the 5' end of 
the GM-CSF gene affects secondary structure of the GM-CSF mRNA 
sequence and the translation thereof, the Inventors have found that 
5 alterations in the 5 1 region of the G-CSF gene leads not only to 
increased expression of G-CSF protein in both Pj_ gene N-R8S and Trp 
controlled expression vectors in E. coli, it also leads to increased 
levels of mRNA steady state transcript in Trp controlled expression 
vectors in E. coll . 

10 Figure 1A is a plot of bone marrow proliferation activity 

obtained for Xenopus oocyte supernatants made from the fractionated 
MIA PaCa-2 mRNA. 

Figure IB is a dot blot of MIA PaCa-2 mRNA fractions with a 
CSF-l-specific probe. 

15 Figure 2 is a Northern blot of the RNA obtained from MIA 

PaCa-2 cells {lane 1), LD-1 cells (lane 2), and 5637 cells (lane 3), 
probed with two different 24-mer probes. In 2A the probe was 5'- 
ATGGCTGGACCTGCCACCCACAGC-3', which hybridized to RNA of all three 
lanes. In 2B the oligomer probe was 5 1 -AGAAGCT GGTGAGTGAGT GTGCCA-3 ' . 

20 The nine nucleotides underlined appear only 1n the CHU-2 sequence that 
purports to be G-CSF (Nagata et al . , supra) . This probe did not 
hybridize to any of the cell lines tested under stringent conditions. 

Figure 3A is a plot of the bone marrow proliferation 
activity of the supernatants collected from Xenopus laevis oocytes 40 
25 hours after injection of the pooled MIA PaCa-2 RNA fractions. 
Activity was measured by the murine bone marrow cell proliferation 
assay described 1n the examples below. 

Figure 3B is a Northern blot of the pooled MIA PaCa-2 mRNA 
fractions using a G-CSF specific y 32 P labeled oligodeoxyrlbonucleotlde 
30 probe. 

Figure 4 shows the complete sequence of G-CSF cDNA obtained 
from the sequencing of the BamH I digest of plasmid pP12 and deduced 
amino add sequence encoded thereby. The arrow indicates the position 
of the nine nucleotide sequence GTGATGGAG of the Nagata et al. clone. 
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Figure 5 1s a schematic Illustration showing the prepartlon 
of plasmld pPDl and the placement of a Hind i II restriction site and 
ATG codon 1n the G-CSF sequence. 

Figure 6 Is a schematic Illustration of the prepartlon of 
5 plasmld pPD2 1n which the 3* untranslated region of the G-CSF gene 1s 
excised and transcription termination and mRNA stability are placed 
under control of the positive retroregulatory element. 

Figure 7 is a schematic Illustration of the preparation of 
plasmid pJDl in which G-CSF expression 1s under the control of the P L 
10 promoter and gene N-ribosome binding site. 

Figure 8 1s a schematic illustration of the preparation of 
pJD4, pJD4A and pJD4B. 

Figure 9 1s a schematic illustration of the preparation of 

pPD5. 

15 Figure 10 is a Northern blot of mRNA from pJDl and pPD2 

probed with a G-CSF specific oligodeoxy nucleotide probe. 

Figure 11 shows the DNA sequence and N-terminal amino acid 
sequence encoded by native G-CSF, pJD4B, pPD5 and pJD4A. The ATG 
codon encoding Met was added to native G-CSF and does not occur 1n the 
20 cDNA sequence obtained from MIA PaCa-2 transcripts. 

Figure 12 is an SDS gel showing expression of G-CSF in 
induced and uninduced E. coli host carrying pJD4A and pJD4B. 

Figure 13 is an SDS gel showing expression of G-CSF in 
induced and uninduced E. coli host carrying pPD5. 

25 Figure 14 is a schematic illustration of the preparation of 

plasmid pPD6 in which v^-CSF expression is under control of the P L 
promoter and gene N-ribosorae binding site. 

Figure 15 is an SDS gel showing expression of mG-CSF and 
mG-CSF. The Vj-mG-CSF mobilily reflects a slightly smaller size 
30 consistant with deletion of NH 2 -terminal Met and Thr. 
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A, G-CSF Cloning and Expression 

The G-CSF of the present invention was detected in a 
purified mRNA fraction obtained by sucrose density gradient 
centrifugation of induced MIA PaCa-2 cells as described in PCT 
5 Publication No- W086/04607 published August 14, 1986 and assigned to 
the assignee of the present invention. 

mRNA fractions obtained from the gradient were injected into 
Xenopus laevis oocytes; the oocytes were Incubated, and the 
supernatants tested for the ability to stimulate murine bone marrow 

10 proliferation (bmp). Peak bmp activity was found in supernatants from 
oocytes that had been injected with the 14, 15 and 16S MIA PaCa-2 mRNA 
fractions although peak hybridization to a CSF-1 specific probe was 
obtained with the 18S mRNA fraction. This phenominon is illustrated 
in Figure 1A which shows a plot of bmp activity and CSF hybridization 

15 intensity per mRNA fraction. 

Further, investigation by the present Inventors has 
determined that the peak bmp activity associated with the slower 
sedimentlng RNA fractions is due to the activity of G-CSF as 1s shown 
in Figure 3A and 3B. These figures show that peak hybridization of a 
20 G-CSF specific probe with MIA PaCa-2 mRNA fractions sediraenting slower 
than 18S. Thus, the present invention encompasses an mRNA fraction of 
human cells that 1s capable of producing G-CSF 1n a host cell. 

"G-CSF" as used herein means a protein having the effect of 
stimulating the production of primarily granulocyte colonies or 
25 granulocyte-macrophage colonies in a colony forming assay using bone 
marrow cell progenitors of an appropriate species. A protein having 
this activity has the deduced amino acid sequence shown in Figure 4 
herein and is considered to be within the scope of the invention. 

G-CSF according to the invention may be isolated from the 
30 MIA PaCa-2 cell line. In addition, as shown in Figure 2, G-CSF 
specific RNA sequences are clearly detectable in the mRNA of LD-1 
cells and 5637 cells. Thus, G-CSF obtained by cloning from these cell 
lines as well as any other cell line which 1s G-CSF probe positive, is 
considered within the scope of the invention. Mature G-CSF produced 
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by the LD-1 cell line has been Isolated and amino-acid sequenced. 65% 
of the LD-1 produced protein has a thr residue at the NH 2 -term1nus and 
35% has a pro residue at the NH 2 -term1nus. 

As 1s the case for all proteins, the precise chemical 

5 structure of G-CSF depends on a number of factors. As lonizable amino 
and carboxyl groups are present 1n the molecule, a particular protein 
may be obtained as an acidic or basic salt, or 1n neutral form. All 
such prepartlons which retain their activity when placed in suitable 
environmental conditions are included in the definition. Further, the 

10 primary amino acid sequence may be augmented by derivatization using 
sugar moieties (glycosylatlon) or by other supplementary molecules 
such as lipids, phosphate, acetyl groups and the like, more commonly 
by conjugation with saccharides* The primary amino acid structure may 
also aggregate to form complexes. Certain aspects of such 

15 augmentation are accomplished through post- translation! processing 
systems of the producing host; other such modification may be 
introduced in vitro . In any event, such modifications are Included in 
the definition so long as the activity of the protein, as defined 
above, is not destroyed. It is expected, of course, that such 

20 modifications may quantitatively or qualitatively affect the activity, 
either by enhancing or diminishing the activity of the protein in the 
various assays. 

Further, Individual amino add residues in the chain may be 
modified by oxidation, reduction, or other derivatization, and the 
25 protein may be cleaved to obtain fragments which retain activity. 
Such alterations which do not destroy activity do not remove the 
protein sequence from the definition. 

Modifications to the primary structure itself by deletion, 
addition, or alteration of the amino acids incorporated into the 
30 sequence during translation can be made without destroying the 
activity of the protein. Such substitutions or other alterations 
result in proteins having an amino acid sequence which falls within 
the definition of proteins "having an amino acid sequence 
substantially equivalent to that of G-CSF". 
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For convenience, the mature G-CSF protein amino acid 
sequence shown in Figure 4, deduced from the cDNA clone illustrated 
herein, is designated mG-CSF (mature G-CSF) beginning at the amino 
acid residue threonine designated +1. Figure 4 shows the presence of 
5 a 30 residue putative signal sequence, which is presumably cleaved 
upon secretion from mammalian cells; mG-CSF is represented by amino 
acids 1-174 shown in that figure. Specifically included in the 
definition of human G-CSF are muteins which monomers and dimers, if 
any, are G-CSF and related forms of G-CSF, designated by their 
10 differences from mG-CSF. G-CSF derived from other species may fit the 
definition of "human" G-CSF by virtue of its display of the requisite 
pattern of activity as set forth above with regard to human substrate. 

Also for convenience, the amino acid sequence of G-CSF will 
be used as a reference and other sequences which are substantially 

15 equivalent to this in terms of G-CSF activity will be designated by 
referring to the sequence shown in Figure 4. The substitution of a 
particular amino acid will be noted by reference to the number of the 
amino acid residue which it replaced. Thus, for example, ser 60 G-CSF 
refers to the protein which has the sequence shown in Figure 4 except 

20 that the amino acid at position 60 is serine rather than proline. 
Deletions are noted by a v followed by the numer of amino acids 
deleted from the N-terminal sequence, or by the number of amino acids 
remaining when residues are deleted from the C-terminal sequence, when 
the number is followed by a minus sign. Thus, v^-G-CSF refers to G- 

25 CSF of Figure 4 wherein the first four amino acids from the N-terminus 
have been deleted; v 130 - refers to G-CSF wherein the last 44 amino 
acids following amino acid 130 have been deleted. 

"Operably linked" or "1n operable linkage" refers to 
juxtaposition such that the normal function of the components can be 
30 performed. Thus, a coding sequence "operably linked" to control 
sequences refers to a configuration wherein the coding sequence can be 
expressed under the control of these sequences. 

"Control sequences" refers to DNA sequences necessary for 
the expresion of an operably linked coding sequence in a particular 
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host organism.. The control sequences which are suitable for 
procaryotes, for example, Include a promoter, optionally an operator 
sequence, a RBS, and possibly, other as yet poorly understood, 
sequences. Eucaryotic cells are ; known to . utilize promoters, 
5 polyadenylation signals, and enhancers. 

"Expression system 11 refers to DNA sequences containing a 
desired coding sequence and control sequences in operable linkage, so 
that hosts transformed with these sequences are capable of producing* 
the encoded proteins. In order to effect transformation, the 
10 expression system may be included on a vector; however, the relevant 
DNA may then also be integrated into the host chromosome* 

As used herein "cell", "cell line", and "cell culture" are 
used interchangeable and all such designations incl ude. progeny. Thus 
"transformants" or "transformed, cells" includes the primary subject 

15 cell and cultures derived therefrom without regard for the number. of 
transfers. It is also understood that all progeny may not be 
precisely identical in DNA content, due to deliberate or inadvertent 
mutations. Mutant- progeny which have the same functionality as 
screened for in the originally transformed cell , are included. Where 

20 distinct designations are Intended, it will be clear from the context. 

Control Sequences and Corresponding Hosts 

Procaryotes most frequently are represented by various 
strains of E. coll . However* other microbial, strains may also be 
used, such as bacilli, for example Bacillus subtilis , various species 

25 of Pseudomonas , or other bacterial strains. In such procaryotic 
systems , pi a smi d vectors which contain replication sites and control 
sequences derived from a species compatible with the host are used. 
For example, E. coli is typically transformed using derivatives of 
pBR322, a plasmid derived from an E. coli species by Bolivar, et al . , 

30 Gene , 2^:95 (1977). pBR322 contains genes for ampicillin and 
tetracycline resistance, and thus provides additional markers which 
can be either retained or destroyed in' constructing the desired 
vector. Commonly used procaryotic control sequences which are defined 
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therein to Include promoters for transcription Initiation, optionally 
with an operator, along with RBS sequences, include such commonly used 
promoters as the beta-lactamase (penicillinase) and lactose (lac) 
promoter systems (Chang, et al., Nature (1977) 198 :1056) and the 
5 tryptophan (Trp) promoter system (Goeddel, et al., Nucleic Adds Res. 
(1980) 8_:4057) and the lambda derived P L promoter and N-gene ribosome 
binding site (Shlmatake, et al,, Nature (1981) 292:128), which has 
been made useful as a portable control cassette, as set forth in PCT 
Publication No. W085/03522 published August 15, 1985, However, any 
10 available promoter system compatible with procaryotes can be used. 

As described 1n detail hereinbelow, the native G-CSF 
sequence has not proved to be effective for the production of G-CSF 1n 
such prokaryotlc microbial hosts at high levels. In the present 
invention, alterations have been carried out to the 5* end of the mG- 
15 CSF coding sequence. In addition to placing an ATG start codon at the 
5 1 end of the DNA sequence encoding mG-CSF, a number of changes have 
been made in the nucleotide sequence thereof without changing the 
amino acid sequence of the mG-CSF protein encoded thereby. 

These nucleotide changes encompass codons primarily encoding 
20 the first 10 amino acids of mG-CSF. At least one nucleotide 1n the 
third position of at least one of these codons has been changed from a 
deoxyriboguanldine or deoxyribocytoslne to a deoxyrlboadenlne. These 
changes are made without altering the amino acid residue encoded by 
the codon, and furthermore, the resulting altered codon 1s one which 
25 is not necessarily host-preferred, at least when the host cell is E. 
col i . More preferably, the third position codon alteration Is carried 
out on at least one of the first 10 or 11 codons of mG-CSF wherein the 
5' amino acid sequence of mG-CSF remains 

ThrProLeuGlyProAlaSerSerLeuPro or MetThrProLeuGlyProAlaSerSerLeuPro, 
30 when the additional codon encoding an N- terminal Met 1s Included. 

More preferably, the above-described G to A or C to A codon 
alterations are carried out on at least one of the first 4 or 5 codons 
of mG-CSF wherein the 5' amino add sequence of mG-CSF remains 
ThrProLeuGly or MetThrProLeuGly when the additional codon encoding an 
35 N-terminal methionine is Included. 
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In a more preferred embodiment, the above-mentioned changes 
to the third position nucleotide of codons encoding the 5 1 end of mG- 
CSF are carried out to at least three of the 10 codons of the DNA 
sequence encoding the amino acid sequence 

5 ThrProLeuGlyProAlaSerSerLeuPro. 

In an even more preferred embodiment, the above-mentioned 
changes to the third position nucleotide of codons encoding the 5* end 
of mG-CSF are carried out to at least three of the 4 codons of the DNA 
sequence encoding the amino acid sequence ThrProLeuGly. 

10 When the above-mentioned changes are made in the 5' end of 

the DNA sequence encoding mG-CSF that is In an E. coli compatible 
expression vector 1n operable linkage with the Trp promoter, both 
steady state mRNA transcripts encoding mG-CSF and a protein fraction 
having mG-CSF activity are readily detectable by Northern blotting and 

15 Xenopus laevis oocyte assay, bone marrow proliferation, or granulocyte 
colony stimulation assays, respectively. By contrast, the native mG- 
CSF DNA sequence in otherwise operable linkage with the Trp promoter 
yields neither mRNA transcript nor detectable by Northern blot nor a 
protein fraction having G-CSF activity in the above-mentioned assays. 

20 The native G-CSF DNA sequence 1n an expression vector in 

operable linkage with the P L promoter gene N-RBS, yields detectable 
steady state mRNA transcripts encoding G-CSF, but no protein fraction 
having mG-CSF. However, when the altered mG-CSF DNA sequence 
according to the invention 1s placed in the same vector under the same 

25 promoter RBS control, a protein fraction is produced having mG-CSF 
activity. 

The Invention also concerns muteins of G-CSF in which the N- 
termlnal methionine is not present when G-CSF 1s produced from a 
recombinant host. In prokaryotic recombinant hosts such as E. coll . 
30 the translation initiation codon ATG encodes the amino add 
methionine. As a result, the amino or N^-terminus of recombinant 
eukaryotic proteins produced in these recombinant hosts has a 
methionine residue that is generally not found in the protein when 
produced in the eukaryotic cell from which the DNA was initially 
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obtained. This superfluous NH 2 -termtnal methionine, however, can be 
processed and removed by certain exopeptidase enzymes,, most 
particularly methionine amino peptidase or met amino peptidase. This 
enzyme although capable of removing NH 2 -terminal amino acid residues, 
5 specifically does not remove the NH 2 -terminal methionine with equal 
efficiency from all proteins. In particular, the efficiency of the 
enzyme varies depending upon the Identity of the amino acid residue 
adjacent to the NH 2 -terminal methionine. 

The precise specificity of the enzyme in this regard cannot 
10 be determined without testing nearly an infinite number of substrates; 
however, a useful rule of ■ thumb is set forth by Sherman, F., et al., 
Bio Assays 3:27-31 (1985). Sherman et al. based their considerations 
for specificity of Met-amino-peptidases on the observed forms of 
mutants of i so-l-cytochrome-C from yeast and the published primary 
15 sequence of 82 mature intracellular proteins. They conclude that 
methionine is usually cleaved from residues with a side chain having a 
radius of gyration of 1.29 A or less, but generally not cleaved from 
residues with side chains larger than 1.43 A. This is consistent with 
the observation that mutationally altered iso-l-cytochrome-C taken in 
20 consideration with other published sequences of other proteins from 
procaryotic and eucaryotic systems indicate that N-terminal methionine 
is cleaved when it precedes residues of alanine, cysteine, glycine, 
proline, serine* threonine, or valine, but not when it precedes 
residues of arginine, asparagine, aspartic acid, glutamine, glutamic 
25 acid, isoleucine, leucine, lysine, or methionine. These results are 
generally consistent with those set forth in the illustrations 
below. However, some exceptions occur where the radius of gyration 
for the second amino acid is higher than 1.29 A and the secondary and 
tertiary structure or other conditions are particularly favorable. 
30 Therefore, this aspect of the specificity of the enzyme is intended as 
a general guideline and it should be borne 1n mind that even the 
specificity of particular amino peptidase have not been determined 
with exact precision, i.e., not all possible tertiary structures have 
been tested. Therefore, in order to fall within the definition of 
35 "Met-aminopeptidase" » the enzyme needs only to meet the requirement of 
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specific cleavage of N-term1na1 methionine without cleavage of 
internal methionine residues and without cleavage of N-terminal 
residues other than methionine from any peptide. 

Among the N^-terminal amino acid sequences that may be 
5 processed by amino terminal methionine peptidase, are those in which 
the methionine is followed by ala, gly or pro. In addition, NH 2 - 
- terminal methionine followed by ala-met, gly-met, ala-ser, gly-gly, 
ala-pro, or pro-thr is expected to be processed. 

Thus, any of the form of G-CSF or muteins of G-CSF in which 

10 the above-mentioned sequences comprise the N-terminal portion of the 
amino acid will fall within the definition of G-CSF according to the 
invention so long as the biological activity of G-CSF is maintained. 
In particular, a deletion mutant of G-CSF, in which the threonine 
residue following the N^-terminal methionine is deleted to yield a 

15 Vj-G-CSF 1s illustrated in some detail hereinbelow. Specifically 
mutated forms of G-CSF in which the thr^ residue is substituted with 
ala, or gly, ala-ser, gly-gly, or in which the order of pro and thr 
are reversed will also fall within the scope of the invention. DNA 
species encoding the above-indicated muteins of G-CSF may be made by 

20 site specific mutagenesis of the cDNA encoding native G-CSF using 
known techniques. Oligonucleotides encoding these muteins may also be 
substituted for DNA sequences encoding the native G-CSF amino terminal 
amino acid sequence through ligation of ol igodeoxyribonucleotides 
encoding the altered sequenced into specifically digested G-CSF DNA 

25 sequences. 

B. General Methods for Carrying out the Invention 
Transformations 

Depending on the host cell used, transformation is done 
using standard techniques appropriate to such cells. The calcium 
30 treatment employing calcium chloride, as described by Cohen, S. N. f 
Proc. Natl. Acad. Sci. (USA) (1972) 69:2110, or the RbCl 2 method 
described in Maniatls, et a!., Molecular Cloning; A Laboratory Manual 
(1982) Cold Spring Harbor Press, p. 254 was used for procaryotes or 
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other cells which contain substantial cell wall barriers. For 
mammalian cells without such cell walls, the calcium phosphate 
precipitation method of Graham and van der Eb, Virology (1978) 52:546 
may be used or the method of Wang, et al., Science, 228 :149 (1985). 
5 Transformations into yeast are carried out according to the method of 
Van Solingen, P., et al., J. Bact. (1977) 130:946 and Hsiao, C. L., et 
al., Proc. Natl, Acad. Sci. (USA) (1979 ) 76: 3829, 

Screening cDNA Libraries 

cDNA libraries are screened using the colony hybridization 
10 procedure. Lifts of colonies are made onto nitrocellulose filter 
papers (S & S type BA-85). The colonies are lysed and DMA fixed to 
the filter by treatment for five minutes with 0.5 M NaOH, 1.5 M NaCl , 
and are then washed twice for five minutes each time with 1.0 M Tris 
pH 8, 3 M NaCl. Filters are air dried and baked at 80°C for two 
15 hours. The duplicate filters are prehybridized at 45-50°C for one 
hour in 5 x SSC, 10 x Denhardt's solution (0.2% polyvinylpyrrolidone, 
0.2% Ficoll, 0.2% BSA), 0-1% SDS, 50 irM sodium phosphate pH 7.0, and 
100 pg/ml tRNA. 

Hybridization of the filters is done in a solution similar 
20 to that described above for prehybridization, but also contains 10% 
dextran sulfate with kinased probe at 1 x 10 6 CPM/ml under conditions 
which depend on the stringency desired. Typical moderately stringent 
conditions employ temperatures of 45-50°C for 16-20 hours with 1-5 
ml /filter of DNA hybridization buffer containing probe. For higher 
25 stringencies higher temperatures are employed. The filters are washed 
three times for 15 minutes each time at appropriate temperatures using 
3 x SSC, 0.1% SDS, air dried, and are autoradiographed at -70°C for 
two to three days. 

Vector Construction 

30 Construction of suitable vectors containing the desired 

coding and control sequences employs standard ligation and restriction 
techniques which are well understood in the art. Isolated plasmids, 
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DNA sequences, or synthesized oligonucleotides are cleaved, tailored, 
and rellgated in the form desired. 

Site specific DNA cleavage 1s performed by treating with the 
suitable restriction enzyme (or enzymes) under conditions which are 
5 generally understood in the art, and the particulars of which are 
specified by the manufacturer of these commercialy available 
restriction enzymes. See, e.g., New England Biolabs, Product 
Catalog. In general, about 1 yg of plasmid or DNA sequence is cleaved 
by one unit of enzyme 1n about 20 pi of buffer solutions- In the 
10 examples herein, typically, a 3-10 fold excess of restriction enzyme 
is used to insure complete digestion of the DNA substrate. Incubation 
times of about one hour to two hours at 37°C or other appropriate 
temperatures are workable, although variations can be tolerated. 
After each incubation, protein is removed by extraction with 
15 phenol /chloroform, and may be followed by ether extraction, and the 
nucleic acid recovered from aqueous fractions by precipitation with 
ethanol followed by running over a Sephadex 6-50 spin column. If 
desired, size separation of the cleaved fragments may be performed by 
polyacyl amide gel or agarose gel electrophoresis using standard 
20 techniques. A general description of size separations 1s found in 
Methods in Enzymology (1980) 65:499-560 or Maniatis, Molecular 
Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY (1982). 

Restriction cleaved fragments may be blunt ended by treating 
25 with the large fragment of E. coli DNA polymerase I (Klenow) 1n the 
presence of the four deoxynucleotide triphosphates (dXTPs) using 
incubation times of about 15 to 25 minutes at 20 to 25°C in 50 mM Tris 
pH 7.6, 50 mM NaCl , 6 mM MgCl 2 , 6 mM DTT and 5-10 yM dXTPs. The 
Klenow fragment fills In at 5' overhangs but chews back protruding 3' 
30 single strands, even though the four dXTPs are present. If desired, 
selective repair can be performed by supplying only one of the, or 
selected, dXTPs within the limitations dictated by the nature of the 
sticky ends. After treatment with Klenow, the mixture is extracted 
with phenol /chloroform and ethanol precipitated followed by running 
35 over a Sephadex G-50 spin column. Treatment under appropriate 
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conditions with SI nuclease results 1n hydrolysis of any single- 
stranded portion. 

Synthetic oligonucleotides are prepared by ' the triester 
method of Matteucci, et al. ( J. Am. Chem. Soc. (1981) 103 :3185) or 

5 using commercially available automated oligonucleotide synthesizers. 
Kinasing of single strands prior to annealing or for labeling is 
achieved using an excess, e.g., approximately 10 units of 
polynucleotide kinase to 0.1 nmole substrate in the presence of 50 mM 
Tris, pH 7.6, 10 nM MgCl 2 > 5 mM di thiothrei tol , 1-2 mM ATP, 1.7 pmoles 

10 T-32P-ATP (2.9 mCi/mmole), 0.1 mM spermidine, 0.1 mM EDTA. 

Ligations are performed in 15-30 yl volumes under the 
following standard conditions and temperatures: 20 mM Tris-Cl pH 7.5, 
10 mM MgCl 2 » 10 nrt DTT, 33 yg/ml BSA, 10 mM-50 mM NaCl , and either 40 
vM ATP, 0.01-0.02 (Weiss) units T4 DMA ligase at 0°C (for "sticky end 1 ' 

15 ligation) or 1 mM ATP, 0.3-0.6 (Weiss) units T4 DNA ligase at 14°C 
{for "blunt end" ligation). Intermolecul ar "sticky end" ligations are 
usually performed at 33-100 yg/ml total DNA concentrations (5-100 nM 
total end concentration). Intermolecul ar "blunt end" ligations 
(usually employing a 10-30 fold molar excess of linkers) are performed 

20 at 1 yM total ends concentration. 

In vector construction employing "vector fragments", the 
vector fragment is commonly treated with bacterial alkaline 
phosphatase (BAP) in order to remove the 5 1 phosphate and prevent 
religation of the vector. BAP digestions are conducted at pH 8 in 

25 approximately 150 mM Tris, in the presence of Na + and Mg +2 using about 
1 unit of BAP per yg of vector at 60°C for about one hour. Vector 
fragments subjected to this treatment are referred to herein as 
"bapped". In order to recover the nucleic acid fragments, the 
preparation is extracted with phenol /chloroform and ethanol 

30 precipitated and desalted by application to a Sephadex 6-50 spin 
column. Alternatively, religation can be prevented in vectors which 
have been double digested by additional restriction enzyme digestion 
of the unwanted fragments. 
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For portions of vectors derived from cDNA or genomic DNA 
which require sequence modifications, site specific primer directed 
mutagenesis is used. This is conducted using a synthetic 
oligonucleotide primer complementary to a single stranded phage DNA to 

5 be mutagenized except for limited mismatching, representing the 
desired mutation. Briefly, the synthetic oligonucleotide is used as a 
primer to direct synthesis of a strand complementary to the phage, and 
the resulting double-stranded DNA is transformed into a phage- 
supporting host bacterium. Cultures of the transformed bacteria are 

10 plated in top agar, permitting plaque formation from single cells 
which harbor the phage. 

Theoretically, 50* of the new plaques will contain the phage 
having, as a single strand, the mutated form; 50% will have the 
original sequence. The resulting plaques are hybridized with kinased 

15 synthetic primer at a temperature which permits hybridizaton of an 
exact match, but at which the mismatches with the original strand are 
sufficient to prevent hybridization. Plaques which hybridize with the 
probe are then picked, cultured, and the DNA recovered. Details of 
site specific mutation procedures are described below in specific 

20 examples . 

Verification of Construction 

In the constructions set forth below, correct ligations for 
plasmid construction are confirmed by first transforming E. coli 
strain MM294 obtained from E. coli Genetic Stock Center, CGSC 6135, or 

25 other suitable host with the ligation mixture. Successful 
transformants are selected by ampicillin, tetracycline or other 
antibiotic resistance or using other markers depending on the mode of 
plasmid construction, as is understood in the art. Plasmids from the 
transformants are then prepared according to the method of Clewell, D. 

30 B., et a!., Proc. Natl. Acad. Sci. (USA) (1969) _62: 1159, optionally 
following chloramphenicol amplification (Clewell, D. B., J. Bacterid. 
(1972) 110 :667). The isolated DNA is analyzed by restriction enzyme 
mapping and/or sequenced by the dideoxy method of Sanger, F., et al., 
Proc. Natl. Acad. Sci. (USA) (1977) 74:5463 as further described by 
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Messing, et al . , Nucleic Adds Res. (1981) £:309, or by the method of 
Maxam, et al . , Methods In Enzymology { 1980) 65: 499* 

Hosts Exemplified 

Host strains used in cloning and expression herein are as 

5 f ol 1 ows : 

For cloning and sequencing, and for expression of 
construction under control of most bacterial promoters, E. coli strain 
MM294 (supra), Talmadge, K. , et al., Gene (1980) ^2:235; Messelson, 
M. , et -a'l., Nature (1968) 217 :1110, was used as the host. For 
10 expression under control of the P L N-RBS promoter, E. coli strain K12 
MC1000 lambda lysogen, N 7 N 5 3Cl857SusPgg, ATCC 39531 (hereinafter 
sometimes referred to as MC1000-39513 XDG95 or DG95) is used. 

For M13 phage recombinants, E. coli strains susceptible to 
phage infection, such as E . col i K12 strain DG98 are employed. The 
15 DG98 strains has been deposited with ATCC July 13, 1984 and has 
accession number 1965. 

In addition to bacteria, eucaryotic microbes, such as yeast, 
may also be used as hosts. Laboratory strains of Saccharomyces 
cerevisiae , Baker's yeast, are most used although a number of other 

20 strains are commonly available. While vectors employing the 2 micron 
origin of replication are illustrated, Broach, J. R., Meth. Enz. 
(1983) 101 :307, other plasmid vectors suitable for yeast expression 
are known (see, for example, Stinchcomb, et al., Nature (1979) 282 :39, 
Tschempe, et al., Gene (1980) 10:157 and Clarke, L. , et al., Meth. 

25 Enz. (1983) 101 :300). Control sequences for yeast vectors include 
promoters for the synthesis of glycolytic enzymes (Hess, et al., J. 
Adv . Enzyme Reg . (1968) 7_:149; Holland, et al., Biochemistry (1978) 
_17j4900). Additional promoters known in the art include the promoter 
for 3-phosphoglycerate kinase (Hitzman, et al., J. Biol. Chem. (1980) 

30 255 :2073), and those for other glycolytic enzymes, such as 
glyceraldehyde-3-phosphate dehydrogenase, hexosekinase, pyruvate 
decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3- 
phosphoglycerate mutase, pyruvate kinase, triosphosphate isomerase, 
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phosphoglucose Isomerase, and glucoklnase. Other promoters, which 
have the additional advantage of transcription controlled by growth 
conditions are the promoter regions for alcohol dehydrogenase 2, 
Isocytochrome C, acid phosphatase, degradatlve enzymes associated with 
5 nitrogen metabolism, and enzymes responsible for maltose and galactose 
utilization (Holland, Ibid). Alterations to the 5' end of the mG-CSF 
DNA sequence analogous to those shown to be effective with the Trp 
promoter and P L promoter may also be carried out for use with such 
eukaryotic promoters. 

10 It is also believed terminator sequences are desirable at 

the 3' end of the coding sequences. Such terminators are found in the 
3 1 untranslated region following the coding sequences in yeast-derived 
genes. Many of the vectors illustrated contain control sequences 
derived from the enolase gene containing plasmid peno46 (Holland, M. 

15 J., et al., J. Biol. Chem. (1981) 255:1385) or the LEU2 gene obtained 
from YEpl3 (Broach, J. , et al., Gene (1978)^8:121), however any vector 
containing a yeast compatible promoter, origin of repldatlon and 
other control sequences is suitable. 

It is also, of course, possible to express genes encoding 

20 polypeptides in eukaryotic host cell cultures derived from 
multicellular organisms as described further hereinbelow. See, 
generally for example, Tissue Culture , Academic Press, Cruz and 
Patterson, editors (1973). Useful host cell lines include murine 
myelomas N51, VERO and HeLa cells, COS-7 cells, and Chinese hamster 

25 ovary (CHO) cells. Expression vectors for such cells ordinarily 
include promoters and control sequences compatible with mammalian 
cells such as, for example, the commonly used early and late promoters 
from Simian Virus 40 (SV 40) (Fiers, et al., Nature (1978) 273:113), 
or other viral promoters such as those derived from polyoma, 

30 Adenovirus 2, bovine papiloma virus, or avian sarcoma viruses,... or 
immunoglobin promoters and heat shock promoters. General aspects of 
mammalian cell host system transformations have been described by 
Axel, U-S. Patent No. 4,399,216 issued August 16, 1983. It now 
appears, also that "enhancer" regions are important in optimizing 

35 expression; these are, generally, sequences found upstream of the 
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promoter region. Origins of replication may be obtained, if needed, 
from viral sources. However, integration Into the chromosome is a 
common mechanism for DNA replication in eucaryotes. Plant cells are 
also now available as hosts, and control sequences compatible with 
5 plant cells such as the nopal ine synthase promoter and 
polyandenylation signal sequences (Depicker,, A. t et al., j t Hoi . 
Appl. Gen. (1982)^:561) are available. 

The invention will be better illustrated in relationship to 
the following examples which are intended by the Inventors to be 
10 exemplary and non-limiting. 

Example I 

Isolation and Expression of cDNA Encoding Human G-CSF 

A cDNA clone encoding human granulocyte stimulating factor 
was isolated from the MIA PaCa-2 cell line, and was expressed using a 

15 recombinant vector in COS-7 cells. The MIA PaCa-2 cell line described 
in Yunls, A* A. et al., Experimental Hematol . 12:838-843 (1984), is an 
established cell line publlcally available from the Cell Repository 
Lines {CRD collection of the American Type Culture Collection, 12301 
Parklawn Avenue, Bethesda, MD 20895 under accession number ATCC CRL 

20 1420. The cDNA clone was sequenced and the corresponding amino add 
sequence was deduced. 

A - Initial Detection of mRNA From MIA PaCa-2 Cells Havin g High BMP 
Activity, But Low Hybridization to CSF-1 "Probe? 

The human derived pancreatic carcinoma cell line HIA PaCa-2 
25 was used as a source of mRNA to validate CSF-1 specific probes and for 
the formation of a cDNA library containing an intronless form of the 
human CSF-1 coding sequence- The MIA PaCa-2 cell line produces CSF-1 
at a level approximately 10 fold below that of the murine L-929 cells. 

Negative control mRNA was prepared from MIA PaCa-2 cells 

30 maintained 1n serum-free medium, i.e., under conditions wherein they 

do not produce CSF-1. Cells producing CSF-1 were obtained by 
relnducing CSF-1 production after removal of the serum. 
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Cells were grown to confluence 1n roller bottles using 
Dulbecco's Modified Eagles' Medium (DMEM) containing 10% fetal calf 
serum, and produce CSF-1 at 2000-6000. units/ml. The cell cultures 
were washed, and relncubated 1n serum-free condition to suppress CSF-1 
5 formation. For negative controls, no detectable CSF-1 was produced 
after a day or two of Incubation without serum. Reinduced cells were 
obtained by addition of phorbol nyHstic acetate (100 ng/ml ) to obtain 
production after several days of 1000-2000 units/ml. 

The MIA PaCa-2 mRNA was isolated by lysis of the cell 1n 
10 Isotonic buffer with 0.5% NP-40 1n the presence of ribonucleoside 
vanadyl complex (Berger, S. L, et al., Biochemistry (1979) _18:5143) 
followed by phenol /chl oroform extraction, ethanol precipitation, and 
oligo dT chromatography, and an enriched mRMA preparation obtained. 
In more detail, cells are washed twice in PBS (phosphate buffered 
15 saline) and are resuspended in IHB (140 mM NaCl , 10 mM Tris, 1.5 mM 
MgCl 2 » PH 8) containing 10 mM vanadyl adenosine complex (Berger, S. 
L. , et. al., supra ). 

A non-Ionic detergent of the ethylene oxide polymer type 
(NP-40) is added to 0.5% to lyse the cellular, but not nuclear 
20 membranes. Nuclei are removed by centrifugation at 1,000 x g for 10 
minutes. The post-nuclear supernatant is added to two volumes of TE 
(10 mM Tris, 1 mM ethyl enedi ami netetraace tic acid (EDTA), pH 7.5) 
saturated phenol chloroform (1:1) and adjusted to 0.5% sodium dodecyl 
sulfate (SDS) and 10 mM EDTA. The supernatant is re-extracted four 
25 times and phase separated each time by centrifugation at 2,000 x g for 
10 minutes. The RNA is predpiated by adjusting the sample to 0.25 M 
NaCl, adding 2 volumes of 100% ethanoV and storing at -20°C. The RNA 
is pelleted at 5,000 x g for 30 minutes, is washed with 70% and 100% 
ethanol, and 1s then dried. Polyadenylated (polyA + ) messenger RNA 
30 (mRNA) is obtained from the total cytoplasmic RNA by chromatography on 
oligo dT cellulose (Aviv, J.» et al., Proc. Natl. Acad. Sci. (1972) 
69:1408-1412), The RNA is dissolved in ETS (10 mM Tris, 1 mM EDTA, 
0.5% SDS, pH 7.5) at a concentration of 2 mg/ml . This solution is 
heated to 65°C for five minutes, then quickly chilled to 4°C. After 
35 bringing the RNA solution to room temperature, it is adjusted to 0.4 M 
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NaCl and 1s slowly passed through an oligo dT cellulose column 
previously equilibrated with binding buffer (500 mM NaCl, 10 m Iris, 
1 mM EDTA, pH 7.5 0.5% SDS). The flow- through Is passed over the 
column twice more. The column 1s then washed with 10 volumes of 
5 binding buffer. PolyA + mRNA Is eluted with allquots of ETS, extracted 
once with TE-saturated phenol chloroform and 1s precipitated by the 
addition of NaCl to 0-2 M and 2 volumes of 100% ethanol . The RNA 1s 
repredpitated twice, 1s washed once 1n 70% and then in 100% ethanol 
prior to drying. 

10 Total. mRNA was subjected to 5-20% by weight sucrose gradient 

centrlfugatlon in 10 mM Tris HC1, pH 7.4, 1 mM EDTA, and 0.5% SDS 
using a Beckman SW40 rotor at 20°C and 27,000 rpm for 17 hours. The 
mRNA fractions were then recovered from the gradient by ethanol 
precipitation, and injected into Xenopus laevis oocytes 1n the 

15 standard translation assay. The oocyte products of the RNA fractions 
were assayed in the bone marrow proliferation assay of Moore, R. N. et 
al., J. Immunol. (1983) 131:2374, and of Prystowsky, M. B., et al. f 
Am» : J. Pathol . (1984) 114; 149 and the fractions themselves were 
assayed by dot blot hybridization to a 32-mer probe corresponding to 

20 the DNA 1n the second exon of the genomic sequence (exon II probe). 
These results are summarized in Figure 1A and B. 

The broken line in Figure 1A shows the response 1n the bone 
marrow proliferation ass^y of the supernatants from the Xenopus laevis 
oocytes; Figure IB shows the dot blot results. The most strongly 
25 hybridizing fraction, 11, corresponds to greater than 18S, while the 
most active fractions 8 and 9 correspond to 14-16S. 

The mRNA was also fractionated on a denaturing formaldehyde 
gel, transferred to nitrocellulose, and probed with exon II probe. 
Several distinct species ranging in size from 1.5 kb to 4.5 kb were 
30 found, even under stringent hybridization conditions. 

B. Isolation G-CSF mRNA From MIA PaCa-2 Cells 

Confluent MIA PaCa-2 cells were stimulated In serum free 
Dulbecco's minimum essential medium (DMEM) for 4 days with phorbol 
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my ri state acetate (50 ng/ml ) and retlnolc acid (10 yM). The RNA was 
prepared as described by Chlrguin et al.; briefly, the cells were 
lysed In 5 M guanldlne Isothiocyanate followed by centrlf ligation 
through a 5.7 M cesium chloride (CsCl) cushion; poly A + RNA was 
5 prepared from the lysed cells by one selection cycle on oligo (dT) 
cellulose as described in Maniatls et al., supra. 

In more detail, cells were lysed In a solution which 
contained 5 M guanldlne isothlocyanate, 0.025 M Na-c1trate, pH 7, 0.5% 
sarcosine and 8% B-mercaptoethanol . Molecular biology grade CsCl was 
10 made up to 5.7 M or 40% w/v and buffered with 0.02 M Tr1s pH 7.5 and 
0.002 M Na-EDTA. All solutions were prepared under RNase free 
conditions and passed through 0.45 y Millipore filters before use. 
The lysed cells were then layered onto SW28 ul tracentrlfuge tubes 
which contained layers of 10 ml 5.7 M CsCl and 6 ml 40% CsCl. 

15 After centrlf ugatlon at 25,000 rpm for 18 hours, the RNA 

pellet was dissolved in dH 2 0 and ethanol precipitated twice. 
Polyadenylated (PolyA + ) messenger RNA (mRNA) was obtained by 
chromatography of the total RNA on oligo (dT) cellulose as described 
1n Manlatis, supra (at page 197). 

20 The RNA is dissolved 1n sterile H 2 0 and heated to 65°C for 

five minutes. An equal volume of a solution containing 0.040 M Tris 
CI pH 7.6 1.0 M NaCl, 0.002 H EDTA and 0.2% SDS is quickly added and 
the sample is cooled to room temperature. The sample is then loaded 
on an oligo (dT) column that has been equilibrated with a buffer 

25 containing 0.020 M Tris pH 7.6, 0.5 M NaCl, 0.001 M £DTA and 0.1% 
SDS. The flow through is collected, heated to 65°C, cooled to room 
temperature and passed over the column once more. The column 1s then 
washed with 10 volumes of wash buffer (0.02 Tris, 0.1 M NaCl, 0.001 M 
EDTA, 0.1% SDS). Poly A + mRNA 1s eluted with aliquots of 0.01 H Tris 

30 pH 7.5, 0.001 M EDTA and ethanol -precipitated twice. 

334 yg of MIA PaCa-2 Poly A + mRNA was fractionated to 5-25% 
by weight sucrose gradient centrifugation in 20 mM Tris HC1, pH 7»5, 1 
nM EDTA, and 0.5% sarcosine using a Beckman SW40 rotor at 20°C and 
27,800 rpm for 16 hours. The mRNA fractions were collected in 400 yl 
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fractions and ethanol precipitated twice* Fractions were pooled and 
resuspended in 15 pi H 2 0. 

Morthern blots were prepared by the electrophoresis of 5 yg 
of poly A + RNA per lane or 1 yl of each pooled RNA fraction through 1* 
5 agarose gels containing 0.5 M formaldehyde followed by blotting onto 
nitrocellulose filters- After baking the filters for 1.5 hours at 
80°C, they were prehybrldlzed (5 x SSC, 10 x Denhardt's solution (0.2* 
polyvinylpyrrolidone. 0.2% Ficoll, 0.2% BSA), 0.1% SDS, 50 nW sodium 
phosphate pH 7.0, and 100 p g/ml tRNA) for one hour at 55°C. The blots 

10 were hybridized for 16 hours at 55°C 1n a similar solution that also 
contained 10% dextran sulfate and 10 6 cpm per ml of an oligonucleotide 
probe labeled with r 32 P-ATP and polynucleotide kinase. The blot shown 
in Figure 3B was washed at 55°C in 3 x SSC, 0.1% SDS. This 
oligonucleotide probe had the sequence 5 1 -GTAGGTGGCACACAGCTTCTCCTG-3 1 

15 and was designed based on the sequence of the CHU-2 cDNA clone 
described by Nagata et al., Nature, 319: 415-418 (1986). 

The RNA pools that were described in the above assay were 
translated in a Xenopus laevis oocytes assay by the injection of 50 nl 
of of RNA into each oocyte as described in Gurdon et al., Nature , 
20 233 :177-180 (1971). Supernatants of 10 yl per oocyte were collected 
after 40 hours and assayed for CSF activity. 

The Xenopus laevis supernatants from each hybridization 
fraction were assayed for CSF activity in a murine bone marrow cell 
proliferation assay as described in Moore, et al., J. Immunol . . 

25 131: 2374-2378 (1983). Briefly, in this assay, 5 x 10 4 murine bone 
marrow cells per/well were incubated in 96-well plates (12 x 8) with 
serially diluted Xenopus laevis oocyte supernatants made from 
positively hybridizing mRNA fractions. After three days, 3 H thymidine 
(0.5 uCi/well) was added, and after six hours the cells were harvested 

30 and counted in a liquid scintillation counter. 

Peak bone marrow proliferation was found in Xenopus laevis 
oocyte supernatants made with mRNA fractions that were most strongly 
positive 1n Northern blots with the above-described oligonucleotide 
probe as shown 1n Figure 3. 
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C. Identification of G-CSF Clones 1n a HIA PaCa-2 cDNA Library 

A cDNA library was prepared from the enriched MIA PaCa-2 
mRNA as described In Kawasaki, et al.^ Science , 30:291-296 {1985). 
Briefly, the method used ollgo (dT) priming of the poly A + tails and 

5 AMV reverse transcriptase employing the method of Okayama, H. et a!., 
Hoi. Cell Biol. , _3: 280-289 (1983) This method results in a higher 
proportion of full length clones than does poly (dG) tailing and 
effectively uses as host vector portions of two vectors therein 
described, and readily obtainable from the authors, pcDVl and pLl* 

10 The resulting vectors contain the insert between vector fragments 
containing proximal BamH I and Xhol restriction sites; the vector 
contains the pBR322 origin of replication, and ampicillin resistance 
gene and SV40 control elements which result in the ability of the 
vector to effect expression of the inserted sequences in COS-7 cells. 

15 A 1.2 x 10 6 clone library in E. coll obtained from the above 

enriched MIA PaCa-2 mRNA by the Okayama and Berg method was then 
probed using the same oligonucleotide probe that yielded a positive 
signal on the most active pooled MIA PaCa-2 mRNA fractions. To probe 
the library, E. coli containing the Okayama-Berg vectors were grown up 

20 on nutrient medium. Colonies were lifted onto nitrocellulose filter 
papers and were lysed. DNA was fixed to the filter by treatment for 
five minutes with 0.5 mM NaOH t 1.5 M NaCl . Filters were then washed 
twice for five minutes each time with 1.5 M Tris pH 8, 3 M NaCl and 
were air dried and baked at 80°C for two hours. 

25 The filters for the screening were prehybridized and 

hybridized to the y 32 P labeled probe as described above in Section B 
of the example, but both prehybridization and hybridization were 
carried out at 50°C. Plasmids pP12 and pP28 were determined to be 
probe positive and were further characterized. 

30 D. Sequencing of G-CSF MIA PaCa-2 Plasmids cDNA 

pP12 was digested with BamH I and subcloned into a M13mpl9 
vector and sequenced using the dideoxy chain-termination method. 
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The DNA sequence and predicted protein sequence of pP12 are 
shown In Figure 4. The cDNA Insert 1n pP12 1s 1510 base pairs long 
excluding the poly (dG) and poly (dA) tails; 1t contains 11 more bases 
of 5' untranslated sequence than the CHU-2 G-CSF clone. The major 
5 difference between this clone derived from MIA PaCa-2 and CHU-2 cDNA 
clones of Nagata et al. is a 9 base pair Insertion (GTGATGGAG) in the 
CHU-2 clone that would encode the amino acid residues Val-Ser-Glu just 
prior to cys-36 1n the MIA PaCa-2 G-CSF as Indicated by the arrow in 
Figure 4. There are two other differences; an A at position 588 in 
10 the MIA PaCa-2 clone (G in the CHU-2 clone) 1s a silent third base 
change, and a T at position 1237 in the MIA PaCa-2 clone (C in the 
CHU-2 clone) 1s in the 3' untranslated region. 

E. Activity of the pP12 and pP28 Proteins Produced in COS-7 Cells 

Plasmids pP12 and pP28 were purified using a CsCl gradient 
15 and COS-7 cells were transfected using a modification {Wang, A. M. , et 
al., Science , 228 :149 (1985)) of the calcium phosphate coprecipl tation 
technique. After Incubation for three days, supernatants were 
collected and G-CSF production was assayed in a colony forming assay 
and the murine bone marrow proliferation assay, as described above. 

20 CSF activity was measured by both murine bone marrow cell 

proliferation and colony formation assays. For the colony assay, 1 x 
10 5 mononuclear bone marrow cells from Balb/c mice were used in a 
total of 1 ml , containing 5% cell supernatant, 15% fetal calf serum, 
and 0.3% agar 1n RPMI 1640 medium. Colonies of greater than 40 cells 

25 were counted after seven days, and cell type determined by 
cytocentrifuglng individual colonies and staining with a modified 
Wright stain. Each unit of CSF stimulates the formation of one 
colony. For a control, COS-7 cells were mock transfected under the 
same conditions by carrying the cells through the transfection steps 

30 without exposure to a plasmid. The results of the assay are described 
below 1n Table I. 
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TABLE 1 







Colony Assay 




Transfectlon 


Proliferation (cpm) 


Units/ml 




*GM b 


5 pP12 


4595 


>280 


58 


42 


P P28 


6594 


>280 


83 


17 


Mock 


633 


0 


0 


0 



a = Colonies containing only granulocytes. 

b = Colonies containing both granulocytes and macrophages. 

10 EXAMPLE II 

Cloning S-CSF for Expression 1n E. coll 

A. Preparation of G-CSF Sequence: Insertion of 5' Hindi II 
and ATG Codon 

In order to facilitate manipulation of the G-CSF DNA 

15 sequence in various cloning and expression vectors, the G-CSF sequence 
was altered by inserting a Hindlll site and ATG codon 1n mG-CSF 3' of 
the GCC codon encoding alanine (-1). 

. Plasroid pP12 was digested with BamH I endonucTease and the 

digest was electrophoresed in a Tris-acetate, 0.5% low-melting point 

20 agarose gel. The small fragment band carrying the G-CSF DNA sequence 
was melted and ligated into BamHI-digested M13mpl9 using T4 DNA Ugase 
under sticky end conditions. After ligation, the mixture was used to 
transform E. coll strain DG98, ATCC accession number 39,768. The 
transformed cells were plated in the presence of 0.3 mM 

25 isopropylthlogalactoside (IPTG) obtained from Sigma Chemicals (St. 
Louis, MO) and 0.3 mM X-gal on a lawn of DG98 and grown at 37°C. Non- 
alpha complementary white plaques were grown in liquid broth and a 
sample of the culture was used to purify replicative form (RF) DNA. 
The presence of the insert was confirmed by BamH I digestion and sizing 

30 the fragment on a 0.7% agarose gal. Phage having the expected insert 
were designated pA3. 
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A Hind lll site and ATG Initiation codon were placed in the 
G-CSF sequence adjacent to and In frame with the codon encoding the N- 
terminal threonine of the mG-CSF sequence by means of site-specific 
mutagenesis using a chemically synthesized purified 33-mer 
5 oligodeoxyribonucleotide having the following sequence: 

5'-GTG CAG GAA GCC AAG CTT ATG ACC CCC CTG GGC-3\ 

The Hindlll site immediately upstream of the ATG codon 
permits the convenient cloning of the mG-CSF-encoding DNA sequence in 
subsequent manipulation. Approximately 10 pmoles (pH) of the 33-mer 

10 was hybridized to about 1 pM of single stranded DNA from pA3 in about 
13 w l of 10 mM Tris pH 7.4, 90 mM NaCI , 10 mM MgCl 2 by heatirtg to 85°C 
for five minutes, followed by 45°C for 20 minutes. The annealed 
mixture was chilled on ice and adjusted to 18 v l by the addition of 
dithlothrietol to 10 mM, each dXTP to 0.5 nfl and 5 units of DNA 

15 Polymerase I Klenow fragment. The reaction mixture was incubated on 
ice for 20 minutes followed by incubation at room temperature for one 
hour. The repair reaction mixture was then used to transform E. coli 
strain DG98 as above, plated onto agar plates and incubated overnight 
to obtain phage plaques. 

20 Plaques were blotted using nitrocellulose filter circles and 

the filters were treated as described to lyse the cells, denature the 
DNA, neutralize, rinse, fix the DNA to the filter and incubate in 
prehybridization buffer. The 33-mer oligonucleotide was end labeled 
with y 32 P using polynucleotide kinase and hybridized to the filters at 

25 45°C overflight. Filters were washed and subjected to 

autoradiography. Probe positive plaques were, grown in liquid culture 
and characterized by BamHI Hindlll digestion and agarose gel sizing to 
confirm the presence of the expected fragments. One clone having the 
desired Insert was designated pApooL. pApooL was digested with BamH I 

30 and Hindlll. The approximately 1500 base pair BamHI-Hindlll fragment 
was isolated, purified, and retained for ligation into a plasmid 
having the desired promoter and terminator sequences.. 
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B. Placement of G-CSF Coding Sequence Under Trp Control 

To place the G-CSF coding sequence under Trp control and 
under the Influence of the BT positive retroregul atory element (PRE), 
the G-CSF coding sequence was cloned Into plasmid pAW703. pAW703 1s a 
5 plasmid having the Trp promoter and PRE 1n operable linkage with a 
Hind i I I -BamH I fragment which contains the gene encoding human tumor 
necrosis factor (TNF). 

As described below in detail, pAW703 is digested with BamH I 
and Hind lll and the BamH I and Hindi I I fragment of pApool encoding G- 
10 CSF is li gated in place of the TNF f ragmen t. 

The following describes the construction of pAW703 and the 
cloning of G-CSF into this plasmid to make pPDl. An alternate 
construction using pTrp3 is also described* 

B.l. Construction of pAW703 

15 Plasmid pAW703 was made from plasmids pAW701 and pAW711 as 

follows. Plasmid pAW711 (ATCC 39,918) which contains a 400 bp BamHI- 
Sail fragment encoding the PRE was digested with Sal I and BamH I 
endonucleases under conditions suggested by the supplier to yield an 
approximately 400 bp fragment containing the PRE. The digest was 

20 loaded on a 1% agarose preparatory gel and the small Sal I-BamH I 
fragment containing the PRE was isolated, electroeluted and retained, 

Plasmid pAW701 was used to obtain the Trp control sequences 
for regulating expression of the gene* 

pAW701, a derivative of plasmid pTrp3, was made by digesting 
25 pTrp3 with BamH I and Hind i II endonucleases and ligation of the large 
Hind i I I -BamH I fragment with Hind i 1 1 -BamH I fragment carrying the TNF 
gene. In the construction of pAW703, pAW701 was digested with BamH I 
and Sail endonucleases. The large BamH I -Sal I fragment of pAW701 
containing the Trp promoter Hind i II site and TNF sequence was Isolated 
30 by electroelution from a preparative agarose gel, and was 11 gated to 
the small BamH I -Sall fragment containing the PRE, under sticky end 
conditions using T4 DNA ligase to produce pAW703. 
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Both pAW703 and pApool were digested with BamH I and Hindi 1 1 
under conditions suggested by the manufacturer. The large Hindlll- 
BamHI fragment from pApooL was gel isolated and ligated with the 
pAW703 digest using T4 DNA llgase. 

The ligation mixture was used to transform competent E. coll 
strain MM294. Transformants were screened for amplcillln 
resistance. Plasmlds from a number of amplcillln resistant colonies 
were Isolated, digested with Hlndlll and BamH I and run on a 1.5% 
agarose gel with DNA sizing standards. One clone having the expected 
size fragments was designated pPDl. 

B.2. Trp Expression Vectors 

pTrp3 is a host expression vector containing a Trp promoter 
and RBS Immediately 5' of a Hindi 1 1 restriction site, thus permitting 
Insertion of a coding sequence having a Hindi 1 1 site immediately 5' of 
a start codon under control of the Trp promoter. The backbone vector 
for P Trp3 is pBR322. P Trp3 was deposited with the American Type 
Culture Collection, 12301 Parklawn Drive, Rockville, HD (ATCC) on 
December 18, 1984, and has accession number 39,946. 

B.2.A. Preparation of pTrp3 

To construct the host vector containing the Trp control 
sequences behind a Hindlll site, the Trp promoter/operator/rlbosome 
binding site sequence, lacking the attenuator region, was obtained 
from pVH153, obtained from C. Yanofsky, Stanford University. Trp 
sequences are available In a variety of such plasmlds known In the 
art. pVH153 was treated with JJhal (which cuts leaving an exposed 3' 
sticky end just 5' of the Trp promoter) blunt ended with Klenow, and 
partially digested with TapJ. The 99 bp fragment corresponding to 
restriction at the TaoJ site, 6 nucleotides preceding the ATG start 
codon of Trp leader were isolated, and then 11 gated to EcoRl 
(repair )/ClaI digested, pBR322 to provide pTrp3. 

An improved form of pTrp3 also includes a positive 
retroregulatory sequence in a position 1n the vector downstream of the 
coding sequence to be expressed as described herelnbelow. 
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An expression vector for G-CSF having the same control 
sequences as pAW703 may be made directly from pTrp3 (ATTC 39,940) and 
pAW711 (ATCC 39,918) and pP12 as follows. 

pTrp3 is digested with Hindlll and Sail . The large Hindi II- 

5 Sail fragment is Isolated by electroelution from a preparative gel. 
The small BamHI-Sal I fragment containing the PRE is isolated from 
pAW711 as described above and ligated to the large Hindlll-Sall 
fragment of pTrp3 under sticky end conditions with T4 DNA ligase. 
pPDl may also be made directly by combining the BanHI-Hlndlll fragment 

10 of pP12 with the above-mentioned fragments from pTrp3 and pAW711 and 
11 gating under sticky end conditions with T4 DNA ligase. 

C. Placing a BamHI Restriction Site Downstr eam of the G-CSF 
translation Termination Site 

The DNA sequence encoding G-CSF in pA3 contained 
15 approximately 930 nucleotides in the transcribed but untranslated 
region 3" of the translation stop codon TGA 1n the G-CSF sequence. In 
order to remove these untranslated codons and to place transcription 
termination and mRNA stability under the influence of the positive 
retroregulatory element, a BamH I site was placed by site-specific 
20 mutagenesis immediately 3* of the G-CSF translation termination 
signal. 

A chemically synthesized purified 27-mer deoxyribonucleotide 
was chemically synthesized using a commercial DNA synthesizer and had 
the following sequence: 
25 CCC AGC CCT GAG GAT CCA AGC CCT CCC 

Approximately 10 pM of the 27-mer was hybridized to about 1 
pM of single-stranded DNA from PpA3 in about 13 w l of 10 mM Tr1s pH 
7.4, 90 mM NaCl, 10 mM MgCl 2 by heating to 85°C for five minutes, 
followed by 45°C for 20 minutes. The annealed mixture was chilled on 
30 ice and adjusted to 18 yl by the addition of dithiothreitol to 10 mM, 
each dXTP to 0.5 mM and 5 units of DNA Polymerase I Klenow fragment. 
The reaction mixture was incubated on 1ce for 20 minutes, followed by 
incubation at room temperature for about one hour. 
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The reaction mixture was then used to transform coll 
strain DG98, plated onto agar plates and Incubated overnight to obtain 
phage plaques. Plaques were blotted using nitrocellulose filter 
circles arid the filters were treated as described to lyse the cells, 
denature the DNA, neutralize, rinse, fix the DMA to the filter and 
Incubate 1n prehybrldlzatlon buffer. The 27-mer oligonucleotide was 
end labeled with phosphate using polynucleotide kinase and 

hybridized to the filters at 45°C overnight. Filters were washed and 
subjected to autoradiography. One of the probe positive plaques was 
designated pBpool. 

pBpool was grown In liquid culture and digested with Apal 
and BamH I endonucleases under conditions specified by the 
manufacturer- The coding region of G-CSF has an internal Apal site. 
This digestion generated an Apal -BamHI fragment of approximately 540 
base pairs, with the newly Inverted BamH I end Immediately 3' of the 
translation stop codon of the mG-CSF gene. 

Plasmid pPDl was also digested with Apal and BamH I under 
conditions suggested by the supplier. This digestion yielded an Apal- 
jJamHI fragment In which the Apal site was the same as that described 
above and the BamH I site 1s 3' of the untranslated region of the G-CSF 
gene. 

The two digests were mixed at a molar excess of digested 
pBpool to pPDl and ligated under sticky end conditions using T4 DNA 
ligase. The ligation mixture was transformed into E. coli strain 
MM294. Transformants were screened for ampidll 1n resistance. 
Several colonies were isolated and digested with Hindi I I and BamHI. A 
colony having the correct length Hindlll-BamHI insert was designated 
pPD2. 

D. Placing mG-CSF Under P, Control 

Plasmid pLAPHg 1s a plasmid having the P L promoter and gene 
N-RBS controlling the expression of the Aphe gene. The Aphe gene Is 
contained 1n a " Hindlll-BamHI fragment. The plasmid further contains 
the positive retroregulatory element 3' to the Aphs gene with a BamHI 
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site directly 5' of the PRE. Plasmid plAphf* was made from plasmld 
pFC54.T (ATCC number 39.739). Plasmld pLAPHB 1s exactly the same as 
pFC54.T except that the latter plasmld has a Hindi I I -BamHI fragment 
encoding des-alanyl-IL-2 1n place of the Aphp encoding fragment. 

5 Thus, the following steps to place the expression of 6-CSF under 
temperature sensitive P L promoter-gene N-RBS and PRE control can also 
be carried out using pFC54.T. The Hindi I I -BamHI fragment of pPD2 
having the ATG Initiation codon was placed under ? L promoter control 
and PRE influence as follows: 

L0 Plasmid pLAPHB was digested with Hind i I I and BamH I under 

conditions suggested by the supplier. Plasmld pPD2 was digested with 
the same endonucl eases. The digests were combined and li gated with T4 
DNA ligase with the pPD2:pLAPHB digests at a 10:1 molar ratio. 

After ligation, the ligation mixture was used to transform 

15 competent E. coli host strain XNDG95. Ampicillin resistant colonies 
were selected and plasmids were isolated therefrom, digested with 
BamHI and Hind lll and a transformant having the expected insert of 
approximately 540 bp was designated pJDL 

EXAMPLE III 
20 Expression of roG-CSF in pPD2 and pJDl 

A. G rowth and Induction of pPD2 in Transformed E. coli Strains 
MM294 and pJDl in Transformed E. coli in xN DS95 

Initial attempts to produce mG-CSF using the two different 
expression vectors were not successful. The first vector, pJDl, 

25 {Figure 5) consisted of the DNA encoding the mature mG-CSF protein 
under the control of the bacteriophage x P L promoter and the X gene N- 
RBS. The Bacillus thuringiensis positive retroregulating element 
(PRE) 1s immediately downstream of the mG-CSF coding region. In 
addition, the origin of replication, derived from ColEl, contained 

30 mutations that conferred a temperature-sensitive Cop" phenotype. 

coli cells, strain DG95x (x N 7 N 33 C1837 sus P80), harboring pJDl were 
grown at 30°C to an optical density of 0-3 at 600 nm. The P L promoter 
and plasmid replication were then induced by shifting the temperature 
to 42 °C for three hours. 
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The second vector pPD2> consisted of DNA encoding the mG-CSF 
protein under the control of the E. coli Trp promoter- The Bacillus 
thurlngiensis positive retroregulating element was Immediately 
downstream of the mG-CSF coding region. E. coll cells, strain MM294, 
5 were grown overnight 1n M9 medium plus 100 v g/ml of Trp and 50 g g/nrt 
of amplcitlin. In the morning the cells were washed twice with M9 
medium and the Trp promoter was then Induced by setting up cultures at 
ah optical density of 0.05 at 600 nm in M9 medium without Trp, The 
cells were grown at 37 °C to an optical density of 0.8 at 600 nm. 

10 After induction of pJDl and pPD2, extracts of induced and 

unlnduced cells were prepared by boiling the cells for five minutes 1n 
Laemmli loading buffer (LaemmH, Mature, 227:680-685 (1970)) which 
contains sodium dodecyl sulfate (SDS) and 2-mercaptoethanol (2ME ). 
After electrophoresis through 12.5% polyacryl amide gels (PAGE), 

15 containing SDS and 2ME, and staining with Coomassie brilliant blue, an 
induced protein band of the molecular weight corresponding to mG-CSF 
could not be detected. 

B. Steady State mG-CSF mRNA Level After Induction of Cells 
Harboring pJDl and pPD2 ~ " 

20 To determine the steady level of G-CSF transcripts in cells 

harboring pJDl and pPD2, RNA was prepared as follows: 20 ml of 
Induced or unlnduced cells was mixed with 20 ml of ice cold phosphate 
buffered saline. The cells were pelleted by centrifugation at 4°C and 
resuspended in 3 ml of 0.15 M sucrose-20 nrt sodium acetate. 0.3 ml of 

25 a 20 mg/ml lysozyme solution was added and the mixture was Incubated 
on ice. 0.15 ml of 20* SDS was added and the mixture was extracted 
with equal volumes of phenol .-chloroform (1:1) until the Interface was 
clear (about 4 times). After two ethanol precipitations, the nucleic 
acid mixture was resuspended 1n 100 yl of 20 Tris (pH 7.5J-10 mM 

30 MgCl 2 . Two microliters of a 1 mg/ml solution of RNase-free DNase I 
was added and the mixture was Incubated at 37°C for 10 minutes. Five 
ul of 20% SDS was added and the samples were then phenol :chloroform 
(1:1) extracted twice and ethanol precipitated 2 times. 
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The Northern blotting analysis was performed as described 
using a modification of a published protocol (Hanlatls et al.. 
Molecular Cloning:* Laboratory Manual. Cold Spring Harbor, N.Y. 
(1982)). Briefly, Northern blots were prepared by the electrophoresis 
5 of RNA through 11 agarose gels containing 0.5 M formaldehyde followed 
by blotting onto nitrocellulose filters. After baking the filters for 
1.5 hours at 80 e C, they were prehybridized for one hour at 45°C 1n: 5 
x SSC, 0.2* polyvinylpyrrolidone, 0.2% Flcoll, 0.2* bovine serum 
albumin, 0.1% SDS, 50 mM sodium phosphate pH 7.0, and 100 ug/ml 

10 tRNA. The blots were hybridized for 16 hours at 45 C C in a similar 
solution that also contained dextran sulfate (10*) and 10 6 cpm per ml 
of the mG-CSF specific oligonucleotide S'-CGCTGCGCCATCGCCCTGOATCTT-a' 
that had been labeled with Y 32 P ATP and polynucleotide kinase. The 
blots were washed in 3.2 M te trame thy 1 ammonium chloride for 15 minutes 

15 at 62°C. The bands corresponding to mG-CSF mRNA were then detected by 
autoradiography. 

Figure 10 shows that induced cells harboring pJDl contained 
a mG-CSF mRNA of the expected size as well as smaller mG-CSF related 
mRNA while cells harboring pP02 did not contain detectable mG-CSF 
20 mRNA. 

EXAMPLE IV 

A - Construction of a P, Vector with Alternative mG-CSF Codons 

Two oligodeoxyrlbonucleotides were chemically synthesized 
and purified and had the following sequences: a = 5'- 
25 AGCTTATGACACCATTAGGAC-3 ' , B = 5 1 -TAATGGTGTCATA-3 1 . 

DNA of pJDl was digested with the restriction enzymes 
Hlndlll and Agal and, after electrophoresis, the large fragment was 
isolated from a Tris-acetate, 0.5* low-melting-point agarose gel 
(Crease et al., Methods 1n Enzymology . 101; 78 (1983)). This fragment 
30 was mixed with 20 pmole of each of the above two oligonucleotides. 
This mixture was llgated with T4 DNA Hgase. and used to transform 
competent E. coli cells of the strain DG95. The cells were plated on 
agar plates containing ampicllHn and colonies containing recombinant 
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plasmlds (pJD4A and pJD4B) were Identified by colony hybridization 
(Maniatis et al. supra ) using oligonucleotide A which had been labeled 
with y 32 P ATP and polynucleotide kinase. As shown 1n Figure 11, pJD4A 
and pJD4B, both encode the same protein as pJDl; however, codons 2, 3, 
5 and 4 of pJD4A and pJD4B are different from the ones used in pJDl. In 
addition, codon 5 1n pJD4B also differs from pJDl. This difference 
between pJD4A and pJD4B results from the fact that the end of the 
oligomer duplex that was ligated to the Apal site contained a single 
mismatch to the single-stranded end generated by the enzyme Apal . 

10 This type of mismatch will not interfere with ligations {Hung and 
Wenslnk, Nucleic Acids Research . ^12:1863-1874 (1984)); however, 
depending on the repair of this mismatch, some of the resulting clones 
will regenerate the Apal site and have a pJDl-Uke 5th codon and some 
will lose the Apal site and have a pJD4B-11ke 5th codon. The pJD4A 

15 and pJD4B constructions were verified by DNA sequence analysis. 

B - Expression of mG-CSF Protein by pJD4A and pJD4B 

— £2l± harboring either pJD4A or P JD4B were Induced and 
analyzed by SDS-PAGE as described for pJDl In Example III. Figure 12 
shows that an Inducible protein, with a molecular weight corresponding 
to that of mG-CSF, Is present 1n E. coll cells that harbor either 
pJD4A or pJD4B. 



20 



C " Construction of a P, Vect or With Alternative Codons Enco ding «n 
Amino Metmoni ne Peptidase hrocessable Amino Terminus of MK-CSF 

Two oligonucleotides were chemically synthesized and 
25 purified and had the following sequences: 

C: 5 ' AGCTTATGCCATTAGGAC3 1 
D: 3' ATACGGTAAT 5' 

DNA of plasmld pJDl was digested with restriction enzymes 
Hlndlll and Ap_al and the large fragment was Isolated as described 1n 
30 IV.A. above. 20 pM each of oligonucleotides C and D were mixed with 
the large fragment, ligated with T 4 DNA ligase under sticky end 
conditions, and the ligated mixture was used to transform competent E. 
coli strain DG95. Cells were plated on agar containing ampicllUn an7 
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Amp resistant colonies containing a recombinant pi asmid were 
identified by hybridization with oligonucleotide C, which had been 
labeled with y 32 P ATP using polynucleotide kinase. The plasraid 
obtained from the probe positive colonies was designated pJD6 and 
5 encoded a G-CSF species in which the thrj 1s deleted, I.e., ^-mG- 
CSF. The correct sequence was confirmed by dideoxy sequencing. 

D. Expression of mG-CSF Protein by pPD6 

E. coli DG95 harboring pPD6 was Induced and analysed as 

described for pJDl in Example III. 
10 Figure 15 is a picture of an SDS-PAGE gel of proteins 

produced by E. coli harboring pPD6 and pJD5. The cell clearly shows 

that pPD6 produce an Inducable protein with a slightly smaller 

molecular weight than MG-CSF produced by pJD5. 

The protein band produced by pPD6 was isolated and purified 
15 from a large scale Induction of pPD6. The purified protein was 

sequenced using an automated amino acid sequencer and showed that the 

NH 2 -tenninus of this protein 1s pro rather than met or thr. 

EXAMPLE V 

A. Construction of a Trp Expression Vector Containing Alternative 
20 mG-CSF codons 

DNA from pJD4A was digested with the restriction enzymes 
Hlndlll and BamHI . The small fragment, containing the coding region 
for mG-CSF, was isolated from Tris-acetate 0.5% low-melting-point 
agarose gel. DNA from pPD2 was digested with the restriction enzymes 

25 Hindi II and BamH I and the large fragment was isolated from a Tris- 
acetate 0.5% low-melting-point agarose gel* These two fragments were 
ligated together with T4 DNA ligase and used to transform competent E. 
coli of the strain MM294. An ampicillin resistant colony containing 
the recombinant was picked and the construct was verified by DNA 

30 sequence analysis. The resulting plasmid was designated pPD5. 
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B. Expression of mG-CSF In pPDS 

E. col 1 harboring pPDS was Induced and analyzed by SDS-PAGE 
as described for pPD2 1n the previous example. Figure 13 shows that 
an Inducible protein, with a molecular weight corresponding to that of 
5 mG-CSF, was present in E. coli cells that harbor pPDS. 

C. Using Alternative mG-CSF Codons Increase s the Steady State Level 
of mG-CSF mftNA 

The steady state level of mG-CSF mRNA 1n induced cells 
harboring either pPD2 or pPD5 was compared by Northern analysis as 
10 described above. Figure 14 shows that induced cells harboring pPD5 
contain a mG-CSF transcript of the expected size, while no mG-CSF 
transcripts were detected 1n induced cells harboring pPD2. 

The following plasmids have been deposited in the American 
Type Culture Collection, Rockville, MO USA: 



15 Plasmid Strain Date Accession # 

PJD4A E. coli K12 DG95X 8/12/86 67,181 

p<JD4B T7 coTT K12 DG95x 8/12/86 67,183 

pPDSA TT7 coTT K12 MM294 8/12/86 67,182 

pAWll 17 coTT K12 DG95X 11/8/84 39,918 

20 P Trp3 rT coTT MClOOO 12/18/84 39,946 

pFC54.t TT coTT DG95x 8/7/84 39,789 



These deposits were made under the provisions of the 
Budapest Treaty on the International Recognition of the Deposit of 
Microorganisms for the Purposes of Patent Procedure and the 

20 Regulations thereunder (Budapest Treaty)* This assures maintenance of 
a viable culture for 30 years from date of deposit. The deposits will 
be made available by ATCC under the terms of the Budapest Treaty, and 
subject to an agreement between applicants and ATCC which assures 
permanent and unrestricted availability upon Issuance of the pertinent 

25 U.S. patent. The Assignee herein agrees that if the culture on 
deposit die or be lost or destroyed when cultivated under suitable 
conditions, 1t will be promptly replaced upon notification with a 
viable speciment of the same culture. Availability of the deposits is 
not be be construed as a license to practice the invention in 
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contravention of the rights granted under the 
authority of any government in accordance with its 

patent laws. 

The invention includes the use of G-CSF as 
defined above or a mutein as defined above in 
preparing a medicament, e.g. in preparing a 
pharmaceutical or veterinary formulation. 

In another aspect the invention includes a 
pharmaceutical or veterinary formulation comprising 
G-CSF as defined above or a mutein as defined above 
formulated for pharmaceutical or veterinary use 
respectively, optionally the formulation being in 
unit dosage form and/or also comprising an acceptable 
diluent, carrier or excipient. 

The inventors have disclosed G-CSF and means 
for the efficient expression thereof. Variations 
within the scope of the invention may be made by 
those ordinarily skilled in the art without departing 
from the essence of the invention as claimed herein. 
The illustrative embodiments given hereinbefore are 
not limiting. 
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CLAIMS: 

1. A modified DNA sequence encoding G-CSF, said G-CSF 
having the effect of stimulating the production of primarily 
granulocyte or granulocyte-macrophage colonies 1n colony forming assay 
5 using bone marrow cell progenitors of an appropriate species, wherein 
the 5 1 region thereof comprises deoxyHboadenine substituted for the 
debxyriboguanine or deoxyrlbocytoslne 1n the third position of at 
least one codon wherein said substitution does not change the amino 
acid sequence of G-CSF. 

10 2. The DNA sequence according to claim 1 wherein said 5 1 

region comprises a DNA sequence encoding the amino add sequence Thr 
Pro Leu Gly or Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro. 

3. The DNA sequence according to claim 2 wherein at least 3 
codons have said substitution. 

!5 4. The DNA sequence according to claims 1, 2 or 3 further 

comprising deoxyri bo thymidine substituted for deoxyrlbocytoslne in the 
first position of at least one codon. 

5. A modified DNA sequence encoding mature G-CSF wherein 
the 5' region thereof comprises codons selected from the group 

20 consisting of ACA CCA TTA GGA, ACA CCA TTA, ATG ACA CCA TTA GGA and 
ATG ACA CCA TTA. 

6. A modified DNA sequence encoding a mutein of G-CSF, said 
mutein of G-CSF having the effect of stimulating the production of 
primarily granulocyte or granulocyte-macrophage colonies in colony 

25 forming assays using bone marrow cell progenitors of an appropriate 
species, wherein the mutein does not have an NH 2 -tenninal methionine 
when produced from a recombinant host. 
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7. The modified DNA sequence encoding a muteln of G-CSF of 
claim 6 wherein the NH£- terminal end of said muteln encodes a sequence 
that is processed by met amino peptidase. 

8. The modified DNA sequence encoding a mute in of G-CSF of 
5 claim 7 wherein the DNA sequence encodes an amino terminal end 

selected from the group consisting of met-ala, met-gly, met-pro, met- 
al a-met, met-gly-met, met-ala-ser, met-ala-pro, met-pro-thr, and met- 
pro-leu. 

9. An expression vector comprising the DNA sequence of 
10 claims 1 to 8 in operable linkage with a promoter and ribosome 

binding site. 

10. The expression vector of claim 9 in operable linkage 
with a promoter-ribosome binding site selected from the group 
consisting of P L promoter and ribosome binding sites operable 

15 therewith and Trp promo ter-ribosome binding site. 

11- An E. coli host transformed with the expression vector 
of claim 9 or claim 10. 

12. A culture of E* coli transformed with the expression 
vector of claim 10 having a G-CSF content of at least about 3% of 

20 total cell protein to about 10% of total cell protein. 

13. Human recombinant G-CSF. 

14. An mRNA fraction Isolated from human cells that encodes 
human G-CSF. 

15. The expression vector of claim 10 comprising pJD4 or 

25 pPD5. 
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16. A muteln of G-CSF, said mutein of G-CSF having the 
effect of stimulating the production of primarily granulocyte or 
granulocyte-macrophage colonies 1n colony forming assays using bone 
marrow cell progenitors of an appropriate species wherein said mutein 
does not have an NH 2 - terminal methionine. 

17* The mutein of claim 16 wherein the NB 2 -terminal end of 
said mutein has an amino add sequence that 1s processed by met amino 
peptidase In a recombinant host. 

18. The mutein of G-CSF of claim 17 wherein the amino 
terminal end is selected from the group consisting of met-ala, met- 
gly, met-pro, met-ala-met, met-gly-met, met-ala-ser, met-ala-pro, met- 
pro-thr and met-pro-leu. 

19. The mutein of G-CSF of claim 18 which yields Vj-mG-CSF 
when expressed 1n a recombinant host. 

20. A mutein of roG-CSF selected from the group consisting 
of alaj-mG-CSF, glyj-mG-CSF, prOj-mG-CSF, prOj-thrg-mG-CSF, ala^metg- 
mG-CSF, gly 1 -met 2 -mG-CSF, alaj-serg-mG-CSF, alaj-pro 2 -mG-CSF f proj- 
leu 2 -mG-CSF and v^mG-CSF. 

21. Vj-mG-CSF. 
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22. A method of producing a modified DNA 
sequence encoding G-CSF, said G-CSF having the effect 
of stimulating the production of primarily 
granulocyte or granulocyte-macrophage colonies in 
colony forming assay using bone marrow cell 
progenitors of an appropriate species, which method 
comprises substituting deeoxyriboadenine at the 5' 
region of the sequence for the deoxyriboguanine or 
deoxyribocytosine in the third- position of at least 
one codon wherein said substitution does not change 
the amino acid sequence of G-CSF. 

23. A method according to claim 22 and 
further defined by the specific feature(s) of any one 
or more of claims 2 to 4. 

24. A method of making a modified DNA 
sequence encoding mature G-CSF which includes 
selecting codons for the 5' region thereof from the 
group consisting of ACA CCA TTA GGA, ACA CCA TTA, ATG 
ACA CCA TTA GGA and ATG ACA CCA TTA. 

25. A method of producing a modified DNA 
sequence encoding a mutein of G-CSF, said mutein of 
G-CSF having the effect of stimulating the production 
of primarily granulocyte or granulocyte-macrophage 
colonies in colony forming assays using bone marrow 
cell progenitors of an appropriate species, wherein 
the mutein does not have an NH^-terminal methionine 
when produced from a recombinant host, comprising 
modifying the NH 2 -terminal sequence encoding 
portion of a DNA sequence encoding G-CSF, for example 
by subjecting cDNA encoding native G-CSF to site 
specific mutagenesis to produce an NH^-terminal 
sequence wherein the NH 2 -terminal methionine is 
susceptible to enzymatic cleavage or ligating an 
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oligonucleotide encoding such an NH 2 -terninal 
sequence to a specifically digested G-CSF DNA 
sequence* 

26. A method according to claim 25 and 
further defined by the specific feature of claim 7 or 
of claim 8. 

27. A process for preparing an expression 
vector which comprises forming an operable linkage 
between a promoter and ribosome binding site and a 
DNA sequence as defined in any one of claims 1 to 8, 
the promoter -ribosome binding site preferably being 
selected from the group consisting of promoter 
and ribosome binding sites operable therewith and Trp 
promoter-ribosome binding site. 

28. A method of making an E. coli cell 
capable of expressing G-CSF which comprises 
transforming an E. coli host with an expression 
vector as defined in claim 9, claim 10 or claim 15. 

29. A process for the preparation of 
recombinant G-CSF which comprises culturing an E. 
coli transformed host as defined in claim 11 or a 
culture as defined in claim 12 or E. coli cells made 
by the method of claim 28 so as to permit expression 
of the E. coli genome. 

30. A process according to claim 29 wherein 
the G-CSF prepared thereby is as defined in any one 
of claims 16 to 21. 

31. A pharmaceutical or veterinary 
formulation comprising G-CSF as defined in claim 13 
or a mutein as defined in any one of claims 16 to 21 



- 46 - 



0256843 



formulated for pharmaceutical or veterinary use 
respectively, optionally the formulation being in 
unit dosage form and/or also comprising an acceptable 
diluent, carrier or excipient. 

32. The use of G-CSF as defined in claim 13 
or a mutein as defined in any one of claims 16 to 21 
in preparing a medicament, e.g. in preparing a 
pharmaceutical or veterinary formulation as defined 
in claim 31* 
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FIG. 4 

1 AAAAAACAGCCCGGAGCCTGCAGCCCAGCCCCACCCAGACCC ATGGCTGGACCTCrr.xrr 

HetAlaGlyProAlaThr 
-30 

6 1 CAGAGCCCCATGAAGCTGATGGCCCTGCAGCTGCTGCTGTGGCACAGTGCACTCTGGACA 

GlnSerProMetLysLeuMetAlaLeuGlnLeuLeufceuTrpHisSerAlaLeaTrpThc 
-20 -10 

121 GTGCAGGAAGCCACCCCCCTGGGCCCTGCCAGCTCCCTGCCCCAGAGCTTCCTGCTCAAG 
ValGlnGluAlaThrProLeuGlyProAlaSerSerLeuProGlnSerPheLeuLeuLys 
-11 10 

1 81 TGCTTAGAGCAAGTGAGGAAGATCCAGGGCGATGGCGCAGCGCTC CAGGAGAAGC' T> GTG T 

CysLeuGluGlnValArgLysIleGlnGlyAspGlyAlaAlaLeuGlnGluLysLeuCvs 
x 20 30 

241 GCCACCTACAAGCTGTGCCACCCCGAGGAGCTGGTGCTGCTCGGACACTCTCTGGGCATC 

AlaThrTyrLysLeuCysHisProGluGluLeuValLeuLeuGlyHisSerLeuGlylle 
40 50 

301 CCCTGGGCTCCCCTGAGCAGCTGCCCCAGCCAGGCCCTGCAGCTGGCAGGCTGCTTGAGC 

ProTrpAlaProLeuSerSerCysProSerGlnAlaLeuGlnLeuAlaGlyCysLeuSer 
60 70 

361 CAACTCCATAGCGGCCTTTTCCTCTACCAGGGGCTCCTGCAGGCCCTGGAAGGGATCTCC 
GlnLeuHisSerGlyLeuPheLeuTyrGlnGlyLeuLeuGlnAlaLeuGluGlylleSer 
80 90 

421 CCCGAGTTGGGTCCCACCTTGGACACACTGCAGCTGGACGTCGCCGACTTTGCCACCACC 
ProGluLeuGlyProThrLeuAspThrLeuGlnLeuAspValAlaAspPheAlaThrThr 
100 110 

4 81 ATCTGGCAGCAGATGGAAGAACTGGGAATGGCCCCTGCCCTGCAGCCCACCCAGGGTGCC 

IleTrpGlnGlnHetGluGluLeuGlyMetAlaProAlaLeuGlnProThrGlnGlvAla 
120 130 

541 ATGCCGGCCTTCGCCTCTGCTTTCCAGCGCCGGGCAGGAGGGGTCCTAGTTGCCTCCCAT 

MetProAlaPheAlaSerAlaPheGlnArgArgAlaGlyGlyValLeuValAlaSerHis 
140 ISO 

6 01 CTGCAGAGCTTCCTGGAGGTGTCGTACCGCGTTCTACGCCACCTTGCCCAGCCC^GAGCC 

LeuGlnSerPheLeuGluValSerTyrArgValLeuArgHisLeuAlaGlnPro" 
160 170 

661 AAGCCCTCCCCATCCCATGTATTTATCTCTATTTAATATTTATGTCTATTTAAGCCTCAT 

7 21 ATTTAAAGACAGGGAAGAGCAGAACGGAGCCCCAGGCCTCTGTGTCCTTCCCTGCATTTC 
7 81 TGAGTTTCATTCTCCTGCCTGTAGCAGTGAGAAAAAGCTCCTGTCCTCCCATCCCCTGGA 
li} S^SS ?. AGG A ^ AG G T ^AAT ACCAAGTATTTATTACT ATG ACTGCTCC C CAGCC C^GG C 
901 TCTGCAATGGGCACTGGGATGAGCCGCTGTGAGCCCCTGGTCCTGAGGGTCCCCACCTGG 

i S5i ?^" CTGAGAGTAT( ^ GGTCTC CCACGTGGGAGACAAGAAATCCCTGTTTAATATTTAA 
}° n 2 o\ ACAGCAGTGTT CCCCATCTGGGTCCTTGCACCCCTCACTCTGGCCTCAGCCGtcTGCAct 

1081 gcggcccctgcatccccttggctgtgaggcccctggacaagcagaggtggccaga?c-g2 
1141 gaggcatggccctggggtcccacgaatttgctggggaatctcgtttttcttcttaagact 
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